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Reconstitution of signaling pathways involving single mammalian transmembrane receptors has not been 
accomplished in yeast cells. In this study, intact EGF receptor (EGFR) and a cell wall-anchored form of EGF 
were co-expressed on the yeast cell surface, which led to autophosphorylation of the EGFR in an 
EGF-dependent autocrine manner. After changing from EGF to a conformationally constrained peptide 
library, cells were fluorescently labeled with an anti-phospho-EGFR antibody. Each cell was subjected to an 
automated single-cell analysis and isolation system that analyzed the fluorescent intensity of each cell and 
automatically retrieved each cell with the highest fluorescence. In ~3.2 X 10 6 peptide library, we isolated six 
novel peptides with agonistic activity of the EGFR in human squamous carcinoma A431 cells. The 
combination of yeast cells expressing mammalian receptors, a cell wall-anchored peptide library, and an 
automated single-cell analysis and isolation system might facilitate a rational approach for de novo drug 
screening. 

Sensing of extracellular ligands (e.g., hormones, cytokines and growth factors) via receptors on the cell 
surface modulates intracellular signaling molecules and subsequently regulates gene expression. Analysis of 
receptor-mediated signal transduction allows delineation of signaling pathways involved in the onset of 
various diseases. Thus, it is important to find or design drugs with agonistic or antagonistic activity for target 
receptors. 

To date, seven transmembrane G-protein-coupled receptors (GPCRs) have been recognized as potential target 
molecules for drug discovery (e.g., propranolol as an antagonist of pj-adrenergic receptors, phenylephrine as an 
agonist of a 1- adrenergic receptor) 1,2 . Single transmembrane receptor tyrosine kinases (RTKs) are also potential 
therapeutic targets. After conventional screening for drugs that bind to RTKs on mammalian cells, molecular 
target drugs have been developed to modulate these receptors, particularly for the purpose of treating cancers 
(e.g., gefitinib as an antagonist of the epidermal growth factor receptor (EGFR) 3 , trastuzumab as an antagonist of 
HER2 (a member of EGFR family) 4 ). For drug discovery, it is indispensable to prepare a tremendous number of 
chemical compounds (10 4 ~ 10 5 species) to identify lead compounds that target receptors. Because the initial 
screening is usually undertaken by in vitro administration of each chemical compound to mammalian cells {e.g., 
Chinese hamster ovary cells over-expressing target receptors), it is laborious to evaluate compounds using a large 
number of cell cultures. Furthermore, endogenous regulation mechanisms of target receptors, such as redund- 
ancy, crosstalk and feedback 5 7 , may affect quantitative measurement of second messenger levels 810 . These 
situations led us to develop a new high-throughput screening system of drugs for target receptors without any 
interference from endogenous regulation mechanisms. 

Because Saccharomyces cerevisiae expresses two endogenous GPCRs, STE2 and STE3 11 , various mammalian 
GPCRs have been successfully expressed in S. cerevisiae (e.g., adrenergic, chemoattractant C5a, serotonin and 
somatostatin receptors) 12 . Some mammalian GPCRs activate the yeast pheromone-sensing pathway in a dose- 
dependent manner of each ligand 13,14 , suggesting that yeast is suitable for quantitative analyses of mammalian 
GPCRs and their ligands. For example, Klein et al. 15 succeeded in de novo screening of agonists for the human 
formyl peptide receptor like- 1 receptor (FPRL- 1 R) using yeast cells co-expressing FPRL- 1 R and a random peptide 
library. Yeast cells are also considered as suitable host cells for screening agonists and antagonists of RTKs, 
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because there is no tyrosine kinase activity in yeast cells 16 . When a 
constitutively active EGFR is expressed on the yeast cell surface, the 
EGFR autophosphorylates and recruits Grb2 and Sos near the yeast 
plasma membrane, followed by activation of the Ras signaling path- 
way 17 . However, EGF-dependent activation of the EGFR (i.e., autop- 
hosphorylation) has not been detected in yeast cells, probably 
because the yeast cell wall prevents EGF from accessing the ligand- 
binding domain of the EGFR. 

Here, we report that a cell wall-anchored form of EGF can access 
the EGFR expressed on the yeast cell surface, and EGF activates the 
EGFR in an autocrine manner. Furthermore, a cell wall- anchored 
form of a conformationally constrained peptide library 18 , instead of 
EGF, allowed us to develop a de novo high-throughput screening 
system of agonists for the EGFR. In this system, yeast cells were 
stained by indirect immunofluorescence with an anti-phospho- 
EGFR antibody, and then analyzed with an automated single-cell 
analysis and isolation system 1 ". Finally, in the —3.2 X 10 6 peptide 
library, we have identified six novel peptides that show agonistic 
activity of the EGFR in mammalian cells for a short period. 

Results 

EGF-dependent autophosphorylation of the EGFR in yeast cells. 

Because the yeast cell wall prevents extracellular macromolecules 
from accessing the plasma membrane 1314,17 , both EGF and the 
EGFR were concurrently displayed on the yeast cell surface to 
increase the local concentration of EGF for reconstitution of the 
EGF signaling pathway in yeast cells (Figure la). An N-terminal 
HA-epitope-tagged human EGF was expressed in the periplasm 
with the assistance of MFal-prepro peptide 20 and FL042 (cell 
wall-anchoring domain of FLOl 21 ) (Figure lb, top). A C-terminal 
V5-epitope-tagged human EGFR was expressed as a membrane- 
anchored form with the assistance of SUC2-signal peptide 22 
(Figure lb, bottom). Western blot analysis showed that HA-EGF- 
FL042 and EGFR-V5 were localized in cell wall and cytosolic 
fractions, respectively (Figure lc). Because the calculated molecular 
weight of mature HA-EGF-FL042 was 1 1,783, the faint 12 kDa band 
and broad >20 kDa bands (left panel) were considered as non- 
glycosylated and glycosylated forms of HA-EGF-FL042, respec- 
tively. In addition, the calculated molecular weight of mature 
EGFR-V5 was 133,901, and the broad 130 and 160 kDa bands 
(right panel) might have corresponded to non-glycosylated and 
glycosylated forms of EGFR-V5, respectively. The size of the latter 
band corresponded well with that of the EGFR expressed in 
mammalian cells. Indirect immunofluorescence analyses using 
confocal laser scanning microscopy (LSM) indicated that HA- 
EGF-FL042 was successfully localized on the yeast cell surface 
(Figure Id). After treatment with zymolyase, spheroplasts were 
also subjected to indirect immunofluorescence analyses that 
indicated that EGFR-V5 resided mainly in the yeast plasma 
membrane and partly in the cytoplasm (Figure le). 

Localization of HA-EGF-FL042 and EGFR-V5 on the yeast cell 
surface (see Figure Id and e) suggested that the EGFR was autopho- 
sphorylated by adjacent EGF in an autocrine manner. After incuba- 
tion in HEPES buffer (pH 7.0) for 1 h to facilitate an efficient 
interaction between the EGFR and EGF, the fractions of cytoplasm 
and cell wall of yeast cells were prepared, and then analyzed with 
anti-EGFR phosphotyrosine (pY) antibodies against pY-1068, pY- 
1148 and pY-1173 (Figure 2a). EGFR expressed in yeast cells was 
found to be phosphorylated by the co-expression of HA-EGF-FL042 
(lane 3), which was reduced by the treatment with tyrphostin 
AG1478 23 , a specific inhibitor for EGFR autophosphorylation (lane 
4). Next, yeast cells were treated with zymolyase, and spheroplasts 
were probed with the anti-EGFR pY antibodies. Fluorescence derived 
from each phosphotyrosine was observed in cell peripheries 
(Figure 2c-e, left panels). Flow cytometric analyses confirmed that 
spheroplasts harboring phosphorylated EGFRs were substantially 



increased in an EGF-dependent manner (Figure 2c-e, right panels), 
whereas cell surface display of human interleukin-5 (HA-IL5- 
FL042) (Figure 2b), a non EGFR-related protein, could not induce 
the phosphorylation of EGFR. Since anti-EGFR pY-1173 antibody 
showed good signal to noise ratio in western blot and flow cytometric 
analyses, we selected the antibody for further analyses. Taken 
together, cell surface engineering facilitated, for the first time, detec- 
tion of EGF-dependent autophosphorylation of the EGFR in yeast 
cells, which allowed us to develop a new system to analyze ligand 
interactions with various receptors including RTKs and GPCRs 
without any interference from other mammalian proteins. 

EGF-dependent homo-oligomerization of the EGFR in the yeast 
plasma membrane. As observed in mammalian cells 24 , autophos- 
phorylation of the EGFR upon EGF-binding suggested homo- 
oligomer formation of EGFRs in the yeast plasma membrane. A 
split-ubiquitin (ub) membrane two-hybrid assay 25 was carried out 
with the S. cerevisiae NMY51 strain harboring Lacz and HIS3 
reporter genes under the control of the LexA-driven promoter 
using bait EGFR fused with the C-terminal half of ub (Cub) and a 
LexA-VP16 hybrid transcription factor (DNA-binding domain of 
LexA and transactivating domain of herpes simplex virus VP 16) 
(EGFR-Cub-LexA-VP16), and prey EGFR fused with the N- 
terminal half of ub (Nub) (EGFR-Nub) (Supplementary Figure 
SI a). When ub was reconstituted upon oligomerization of the 
EGFR, the linkage between Cub and LexA was digested by 
endogenous ub C-terminal hydrolase, followed by release of LexA- 
VP16. By co-expression with HA-EGF-FL042, yeast cells could grow 
on His-depleted plates and turn blue on X-gal-containing plates 
(Supplementary Figure Sib). These results indicated that EGF 
displayed on the yeast cell surface remarkably induced homo- 
oligomerization of EGFRs in the yeast plasma membrane. 

Functional interaction of the EGFR with Grb2 and Shcl. By 

expressing constitutively active EGFR in yeast cells, autophos- 
phorylated EGFRs recruited exogenous Grb2, Shcl and Sos to the 
plasma membrane, followed by activation of the yeast RAS signaling 
pathway 17 . Therefore, we introduced HA-EGF-FL042 and EGFR-V5 
into a temperature-sensitive mutant of the S. cerevisiae cdc25h 
strain 26 co-expressing the Sos-fused form of Grb2 or Shcl (Supple- 
mentary Figure S2a). The cdc25h strain harbors temperature- 
sensitive CDC25 (CDC25 ls ) that functions as a GDP-GTP exchanging 
factor for RAS1 and RAS2, and induces growth inhibition at 
37°C. When the EGFR was activated by EGF, yeast cells sur- 
vived even at 37°C due to complementation of the function of 
CDC25 1S by Grb2-Sos or Shcl-Sos (Supplementary Figure S2b). 
In control yeast cells expressing FL042 instead of HA-EGF- 
FL042, almost no cells could grow at 37°C These results indicat- 
ed that the yeast RAS signaling pathway was effectively activated 
by EGF-dependent EGFR autophosphorylation. 

High-throughput de novo screening of EGFR agonists by an 
automated single-cell analysis and isolation system. Recently, we 
have developed an automated single-cell analysis and isolation 
system that isolates single cells of interest from arrays containing 
large number of cells 19 . Based on the functional interaction 
between EGF and the EGFR on the yeast cell surface, a conforma- 
tionally constrained peptide library, instead of HA-EGF-FL042, was 
introduced into yeast cells for de novo screening of EGFR agonists. 
The peptide library consisted of MFtxl-prepro peptide, helix-loop- 
helix (HLH), FLAG-epitope tag and FL042 from the N- to C- 
terminals (Figure 3a). The HLH moiety was composed of two 
helixes split by a hepta-glycine (G 7 )-loop, in which the C-terminal 
helix contained randomized sequences of five amino acids 
(approximately 3.2 X 10 6 variations) on its solvent-accessible 
surface 17 . Cells containing the peptide library were subjected to 
indirect immunofluorescence analysis with an anti-FLAG antibody 
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Figure 1 | Expression and subcellular localization of the EGF-FL042 and EGFR in yeast cells, (a) Schematic drawing ofthe EGFR and FL042 -fused EGF 
in the periplasm of yeast cells. Upon binding EGF, the EGFR autophosphorylates. (b) A cell wall-anchored form of HA-tagged EGF (HA-EGF-FL042) 
was expressed with the assistance of MFccl-prepro peptide. V5-tagged human EGFR (EGFR-V5) was expressed with the assistance of SUC2-signal 
peptide. TM, transmembrane domain. Numbers indicate amino acid residues at the borders of each domain, (c) Co-expression of HA-EGF-FL042 and 
EGFR-V5 in yeast cells. Cytoplasmic (Cyto) and cell-wall {Wall) fractions were analyzed by western blotting using anti-HA (HA) and anti-V5 ( V5) 
antibodies. Single and double asterisks indicate non-glycosylated and glycosylated forms, respectively, (d) Subcellular localization of HA-EGF-FL042 
in yeast cells. Intact cells were stained with anti-HA antibody. Bars = 2 um. (e) Subcellular localization of EGFR- V5 in yeast cells. Spheroplasts were 
stained with anti-V5 antibody. Bars = 2 um. 



and then observed by confocal LSM. Fluorescence was observed 
around the periphery of yeast cells (Figure 3b), similar to HA- 
EGF-FL042 (see Figure Id), indicating that the peptide library was 
efficiently displayed on the yeast cell surface. Meanwhile, based on 



the flow cytometric analyses, about 100% of yeast cells expressed 
both HLH-FLAG-FL042 library and EGFR-V5 concurrently, 
suggesting that randomized HLH peptide library could be screened 
thoroughly in yeast cell expressing EGFR-V5. 



SCIENTIFIC REPORTS | 4:4242 | DOI: 1 0. 1 038/srep04242 



3 



EGFR-V5 - + + + 
HA-EGF-FL042 - - + + 
AG1478 - - - + 



FLQ42 HA-EGF-FLQ42 



150- 



150- 



150- 



150- 



12- 

kDa 1 2 3 4 



EGFRpY1068 
EGFR pY1148 

EGFR pY1173 

EGFR-V5 
HA-EGF-FLQ42 



DIC 



EGFR 
pY1068 




Probe : a-EGFRpY1068 

— HA-IL5-FL042 

— FL042 

— HA-EGF-FL042 

— HA-EGF-FLC42 
(2nd Ab only) 



Alexa 488 



FL042 HA-EGF-FL042 



DIC 



EGFR 
pY1148 




J. 




e 



Probe :a-EGFRpY1148 

— HA-IL5-FL042 

— FLC42 

— HA-EGF-FL042 

— HA-EGF-FL042 

(2nd Ab only) 



10 3 
Alexa 488 



FL042 HA-IL5-FL042 



FL042 HA-EGF-FL042 



DIC 




DIC 



EGFR 
pY1173 




Probe :a-EGFRpY1173 

— HA-IL5-FLC42 

— FL042 

— HA-EGF-FL042 

— HA-EGF-FL042 
(2nd Ab only) 



10 3 
Alexa 488 



10 4 



Figure 2 | Autophosphorylation of EGFR- V5 induced by HA-EGF-FL042 in yeast cells, (a) EGF-dependent autophosphorylation of EGFR. Yeast cells 
co-expressing HA-EGF-FL042 and EGFR-V5 were lysed and separated to cytoplasmic and cell wall fractions. Cytoplasmic fraction was subjected to 
western blot analysis using anti-V5 and anti-EGFR antibodies against phosphotyrosine-1068 (pY-1068), pY-1148 and pY-1173. Expression of HA-EGF- 
FL042 in cell wall fraction was confirmed by using anti-HA antibody. Numbers shown at the bottom indicate lane number. Single and double asterisks 
indicate non-glycosylated and glycosylated forms, respectively, (b) Subcellular localization of HA-IL5-FL042 in yeast cells. Intact cells were stained with 
anti-HA antibody. Bar = 2 |rm. (c-e) Autophosphorylation of EGFR- V5 induced by HA-EGF-FL042 in yeast cells. Spheroplasts were stained with a 
specific antibody against pY-1068 (c), pY-1148 (d) and pY-1173 (e) in the EGFR, and then subjected to microscopic and flow cytometric analyses. HA- 
IL5-FL042 was used as control of non-specific ligand for EGFR. Bars = 2 um. 



Yeast cells co-expressing the peptide library and EGFR were cul- 
tured for 16 h, incubated in HEPES buffer (pH 7.0) for 1 h, fixed with 
4% PFA, treated with zymolyase, stained by indirect immunofluor- 
escence with an anti-EGFR pY-1173 antibody, and then placed into 
microchambers (a polystyrene slide containing 2.5 X 10 5 of 10-um 
wells) by gentle centrifugation. About 80 ~ 90% of wells contained 
one to two spheroplasts. The fluorescent intensity and position of 
each spheroplast were analyzed by the automated single-cell analysis 
and isolation system (Figure 3c). After identifying positive sphero- 
plasts with high fluorescent intensities, they were automatically 
retrieved with a glass capillary and individually transferred into the 
wells of a 96-well PCR plate. Finally, from approximately 5 X 10 6 
spheroplasts (3.2 X 10 6 independent clones), 83 positive spheroplasts 
were isolated and their fluorescent intensity was already sorted in 
descending order. 

Evaluation of EGFR agonist candidates in A431 cells. Using single 
cell-based PCR, 65 DNA fragments encoding the HLH moiety were 
successfully amplified from 83 positive spheroplasts (Figure 4a). 
Among 13 clones with high fluorescent intensities, DNA 
sequencing revealed that four (clones 2, 6, 9 and 10) and three 
clones (clones 8, 11 and 12) encoded identical amino acid 
sequences, respectively (Supplementary Tables SI and S2). Eight 
clones encoding distinct amino acid sequences were expressed as 



an N-terminal thioredoxin-fused C-terminal hexahistidine-tagged 
form in Escherichia coli BL21 (DE3) (Figure 4b), and fused 
proteins were purified with a cobalt-chelating column (Figure 4c). 
These proteins were added to the culture medium of human 
adenosquamous carcinoma A431 cells that overexpress the EGFR. 
Western blot analysis using an anti-EGFR antibody against pY-1173 
demonstrated that six proteins (derived from clones 2, 3, 5, 7, 8 and 
13) possessed an agonistic activity for the EGFR in mammalian cells 
(Figure 5a), which was estimated less than 0.1% of EGF (working 
concentration, 1.5 nM for EGF versus 2.0 uM for peptides) 
(Figure 5b). However, the agonistic activity of each peptide was 
completely inhibited by the treatment with tyrphostin AG 1478, 
corroborating that each peptide functions as an EGFR agonist. 

Discussion 

Here, we demonstrated that the EGFR expressed in the yeast plasma 
membrane could form homo-oligomers upon stimulation with 
cell wall-anchored EGF, followed by its autophosphorylation and 
activation of the yeast RAS signaling pathway. This is the first report 
of yeast cells that can reconstitute a mammalian growth-signaling 
pathway mediated by the EGF-EGFR-Grb2/Shcl-Sos-Ras complex. 
By replacing EGF with a random peptide library, we achieved 
high-throughput screening of EGFR agonists using an automated 
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Figure 3 | High-throughput de novo screening of EGFR agonists, (a) A 

cell wall-anchored form of FLAG-tagged HLH (HLH-FLAG-FL042) was 
expressed with the assistance of MFal-prepro peptide. The HLH consisted 
of two a-helixes (blue cylinders) linked with a hepta-glycine (G 7 ) loop, in 
which the second a-helix contained randomized sequences of five amino 
acids (red dots) on its solvent-accessible surface, (b) Subcellular 
localization of HLH-FLAG-FL042 in yeast cells. Intact cells were stained 
with an anti-FLAG antibody. Bar = 2.5 urn. (c) Flowchart of EGFR agonist 



screening. Approximately 3.2 X 10 6 cells co-expressing HLH-FLAG- 
FL042 (HLH) and EGFR-V5 (EGFR) were incubated at pH 7 for 1 h, fixed 
with paraformaldehyde and then treated with zymolyase, followed by 
incubation with an anti-phospho EGFR antibody. Approximately 2.5 X 
10 5 cells were individually placed into the microchambers of cell array chip 
by brief centrifugation. The chip was scanned with a CCD camera under 
UV light by an automated single-cell analysis and isolation system. In the 
microchamber array chip, positive and negative cells were indicated by red 
and green circles, respectively. Positive cells were automatically retrieved 
with a glass capillary equipped on the micromanipulator. Using 20 chips, 
83 positive cells were obtained, and 65 DNA fragments were amplified by 
single cell-based PCR. Amino acid sequences were deduced from 13 
fragments derived from cells with high fluorescence and then aligned. 
Finally, eight independent EGFR agonist candidates were obtained, 
synthesized in E. coli and then evaluated for autophosphorylation of the 
EGFR in A431 cells. 

single-cell analysis and isolation system to promptly identify six 
novel EGFR agonists that are active in A431 cells. 

A cell wall is one of the major differences between yeast and 
mammalian cells. The thickness and pore size of the yeast cell wall 
are 50 ~ 200 nm 27 and about 0.89 nm, respectively 28 . Because the 
size of the extracellular domain of the EGFR and the Stokes radius of 
EGF are about 1 1 nm 29 and 3.2 nm 30 , respectively, extracellular pro- 
teinous ligands cannot reach the EGFR buried in the cell wall, and 
ligands expressed in the vicinity of the yeast plasma membrane can 
interact with the EGFR. Thus, each yeast cell concurrently displaying 
the receptor and ligand candidates can work as 'an independent and 
autonomous test tube', allowing us to analyze a ligand-receptor inter- 
action at a single cell level without exogenous interference. By dis- 
playing ligands on the yeast cell surface, molecules accumulate in the 
periplasm and cell wall 31 . Assuming that yeast cells are spheres of a 
5.5 urn average diameter (n = 40), and the average thickness of the 
cell wall is 125 nm 27 , the volume of periplasmic and cell wall fractions 
is calculated as about 1.135 X 10~ n ml. Because the maximal num- 
ber of small molecules displayed on the yeast cell surface is estimated 
to be about 10 6 per yeast cell 32,33 , the maximal concentration of cell 
wall-anchored EGF is estimated at 146 uM, which implies that a 1.2 
X 10 5 -fold of the EC 50 of EGF (1.2 nM) to activate the EGFR in 
mammalian cells 34 . High concentrations of ligands on the yeast cell 
surface facilitate the identification of ligands, even if their agonistic 
activities are very low. 

When yeast cells co-expressing the EGFR and HLH peptide library 
were established, we initially tried to isolate yeast cells with high 
fluorescence by fluorescence-activated cell sorting (FACS). Because 
conventional FACS systems only collect positive cells with a propor- 
tion of more than 0.1% 19,35 , we could not isolate positive clones (up to 
100) from a peptide library of more than 1 X 10 6 by FACS. To 
address this issue, we used an automated single-cell analysis and 
isolation system that manipulates single cells arrayed on a chip with 
250,000 microchambers. Consequently, 83 positive cells were iso- 
lated from about 3.2 X 10 6 independent clones, indicating that the 
automated single-cell analysis and isolation system can manipulate 
positive cells at a proportion of less than 0.0026%. In principle, the 
system can isolate yeast cells with the highest fluorescence from a 
peptide library of more than 10 6 , which is nearly impossible by FACS. 

A random peptide library with a flexible structure often weakly 
interacts with target molecules in an induced-fit manner 36 , leading to 
identification of false-positive molecules. Accordingly, we employed 
a conformationally constrained HLH peptide library for EGFR ago- 
nist screening. DNA sequence analyses revealed several independent 
clones encoding identical amino acid sequences (Supplementary 
Table SI), and about 85% of positive clones (11/13) could activate 
the EGFR in A431 cells, which strongly suggests that the HLH pep- 
tide library is reliable for drug screening. Previous studies with EGF 
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Figure 4 | Single-cell PCR cloning and bacterial expression of HLH 
proteins, (a) Single-cell PCR. PCR products obtained by single-cell PCR 
were subjected to agarose gel electrophoresis. Numbers indicate clones 
amplified successfully (65 clones/83 lanes). M, DNA marker. Asterisks 
indicate the position of target bands, (b) Directional TOPO cloning. The 
cDNA fragments amplified by single cell PCR (216 bp) were extracted 
from agarose gel, and then inserted into pET102 directional TOPO vector 
for the bacterial expression. Loop encodes hepta-glycine. Linker encodes 



N-terminal 20 amino acids of FL042. Asterisks indicate the position of 
identical codon. (c) Purification of HLH proteins. HLH proteins expressed 
as an N-terminal thioredoxin-fused and C-terminal hexahistidine-tagged 
form in Escherichia coli BL21 (DE3) and purified with cobalt-chelating 
column. The purity was checked by CBB staining. Numbers indicate the 
clone number. M, protein marker. Asterisk indicates the position of target 
band. 



mutants have revealed that some amino acid residues in EGF (Tyr- 
13, Leu-15, His-16, Arg-41, Gln-43 and Lue-47) 3738 together with 
Gly-12 39 play a key role for EGFR binding and/or mitogenicity in 
mammalian cells (Supplementary Table S3). Furthermore, by using a 
fluorescence-labeled EGFR extracellular domain as a probe for EGF 
mutants displayed on the yeast cell surface, Cochran et al. 40 and Lahti 
et al. 41 have identified and analyzed various EGFR agonists, in which 
mutant 114, 28 and 123 possess over 30-fold affinity to EGFR com- 
pared with wild-type EGF 40,41 . When comparing the amino acid 
sequences among the previously reported EGF mutants and the 
HLH peptides harboring EGFR agonistic activity (see Supple- 
mentary Table S3), the HLH peptides are fundamentally different 
from these EGF mutants not only in primary structure but also 
secondary structure. Thus, the biological function of each amino acid 
contained in the HLH and EGF mutants could not be completely 
compared. However, while some high affinity EGF mutants have 
been found to contain additional 1 — 4 Pro 40 , this amino acid residue 
was found in the identified HLH peptides at high frequency (4 of 6 
agonistic HLH peptides; i.e., clone 2, 7, 8, 13). Since Pro is known to 
be a-helix breaker, the C-terminal a-helix of HLH peptides was 
presumably somewhat too straight to bind to EGFR structurally. 
On the other hand, HLH clone 4, which contains two Pro, did not 
show the EGFR agonistic activity, suggesting that the C-terminal oc- 
helix of HLH peptides was essentially required to activate EGFR but 
should harbor a shorter or bent helix structure for more efficient 
EGFR activation. All EGFR agonist candidates selected by yeast- 
based screening were considered to possess high activity also in 
mammalian cells. Unexpectedly, two proteins (derived from clones 
1 and 4) showed no agonistic activity, and also 1 1 proteins showed 
low activity in A431 cells. This was probably caused by the differ- 
ences of cellular conditions around EGFR between yeast cells and 
A43 1 cells and/or by insufficient contributions from randomized 5 
amino acid residues located to C-terminal ot-helix of the HLH pep- 
tides to bind to EGFR. If the screening of EGFR agonists with new 
peptide libraries [e.g., HLH scaffold harboring >6 randomized 
amino acid residues, new scaffold optimized for EGFR) is performed, 
new peptides with strong EGFR agonistic activity could be identified. 
Taken together, it is still worth performing the agonist screening in 
yeast cells for identifying lead compounds with novel structure in a 
high-throughput manner. Furthermore, as reported by Fujii et al. ls , 
this library is suitable for extracting pharmacophores to design small 
chemical compounds by molecular dynamics simulation. 

A single cell-based high-throughput drug screening system using 
yeast cell surface engineering to facilitate prompt de novo screening 
of lead compounds for rational drug design may accelerate the intro- 
duction of yeast technology to conventional mammalian cell-based 
drug discovery. 

Methods 

Plasmids. A DNA fragment encoding the yeast oc-factor-prepro region (MFalprepro, 
N-terminal 89 amino acids of the MFal precursor), N-terminal HA-tagged human 
EGF (HA-EGF) and yeast cell wall-anchored peptide FL042 (C-terminal 42 amino 
acids of FLOl) was inserted under the control of the GAL1 promoter into pYES2 
(URA3, Invitrogen, Carlsbad, CA, USA) (pYES2-MFalprepro-HA-EGF-FL042, see 
Figure lb). By replacing £GF-coding region with human IL5 gene, pYES2- 
MFoclprepro-HA-IL5-FL042 was constructed for the expression of non EGFR- 
related ligand. A DNA fragment encoding the SUC2 signal peptide (25 amino acids), 
human EGFR (mature form, Leu-25 to Ala-1210 of the EGFR precursor, GenBank 
accession No. NM_005228.3) and C-terminal V5-tagged EGFR was inserted under 
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Figure 5 | Induction of EGFR autophosphorylation in A431 cells by EGFR agonist candidates, (a) After treatment with 0.1 and 1.0 bacterially 
expressed HLH peptides (8 species, clones 2 — 17) for 5 and 20 min, A431 cell lysates were subjected to western blotting with anti-EGFR and anti-EGFR 
pYl 173 antibodies. As a positive control, 1.5 nM EGF was used, (b) After 30-min treatment with 200 nM tyrphostin AG1478, A431 cells were treated with 
2.0 [^M HLH peptides for 20 min, lysed and subjected to western blotting with anti-EGFR and anti-EGFR pY-1173 antibodies. As a positive control, 
1.5 nM EGF was used. 



the control of the GAL1 promoter into pYES3 (TRP1, Invitrogen) (pYES3-SUC- 
EGFR-V5, see Figure lb). 

As a conformationally constrained peptide library, the DNA fragment encoding 
MFodprepro, an intramolecular anti-parallel HLH peptide with a C-terminal FLAG 
tag and FL042 was inserted under the control of the GAL1 promoter into pYES2 
(pYES2-MFcdprepro-HLH-FLAG-FL042). Briefly, primary PCR was carried out 
using 5'-CGC CAA GCT TGG GCG GAG CTC GCA GCT CTG GAA ATG GAA 
CTG GCG GCA CTC GAA GGT GGC GGC GGT GGT GGC GGC AAG CTG-3' 
and 5'-T CCG CTC GAG TTT ATC ATC ATC ATC TTT ATA ATC GGA GCC 
TCC ACC AGC TTT CAA MNN MNN TAG CTT MNN TTT CAA MNN MNN 
CAG CTT GCC GCC ACC ACC-3', in which N represents A, T, G or C, and M 
represents A or C. The DNA fragment encoding randomized HLH-FLAG moiety was 
further amplified by using 5'-CGC CAA GCT TGG GCG GAG- 3' and 5'-TCC GCT 
CGA GTT TAT CAT CAT CAT CTT TAT-3', treated with Hind III and Xho I, and 
then inserted into pYES2-MFalprepro-FL042 vector. The HLH peptide contained 
randomized sequences of five amino acids on the solvent- accessible surface of the 
second helix. 

To examine homo-oligomerization of the EGFR, the DNA fragment of the human 
EGFR (mature form) was inserted under the control of the ADH1 promoter of pPR3- 
SUC (TRP1, Mo Bi Tech, Gottingen, Germany) to express an N-terminal SUC2 signal 
peptide-fused and C-terminal Nub (N-terminal half of human ub)-fused form (pPR3- 
SUC-EGFR), and the same DNA fragment was inserted under the control of the CYC1 
promoter into pBT3-SUC (LEU2, Mo Bi Tech) to express an N-terminal SUC2 signal 
peptide-fused and C-terminal Cub (C-terminal half of human ub)-LexA (DNA- 
binding domain)-VP16 (transactivating domain) -fused form (pBT3-SUC-EGFR). 

To examine interactions of the EGFR with Grb2 and Shcl, the DNA fragment 
encoding MFalprepro and N-terminal FLAG-tagged EGFR was inserted under the 
control of the GAL1 promoter of pGMH20 (HIS3, RIKEN Bioresource Center, 
Ibaraki, Japan) (pGMH20-MFalprepro-FLAG-EGFR). Human Grb2 cDNA 
(NM_002086.4) and human Shcl cDNA (NM_183001.3) were inserted under the 
control of the ADH1 promoter of the pSos plasmid (Stratagene, La Jolla, CA, USA) to 
express an N-terminal Sos (N-terminal 1063 amino acids of Sos)-fused form of Grb2 
(pSos-Grb2) and Shcl (pSos-Shcl), respectively. 

Yeast strains. S. cerevisiae BJ5464 (MATot ura3-52 his3-200 trpl leu2-l pep4::HIS3 
prbl A1.6R canl Gal + ) was obtained from the Yeast Genetic Resource Center (Osaka, 
Japan). 5. cerevisiae cdc25h (MATa ura3-52 his3-200 trpl-901 leu2-3,112 ade2-101 
lys2-801 cdc25-2 Gal + ) and NMY51 (MATa his3-200 trpl-901 leu2-3,112 ade2 
ura3::(lexAop) g -lacZ LYS2::(lexAop) 4 -HIS3 (lexAop) 8 -ADE2 GAL4) were purchased 
from Stratagene and Mo Bi Tech, respectively. Manipulation of yeast cells was 
performed according to the yeast protocol handbook (Clontech). 



Western blotting. Yeast cells were incubated with SD medium containing 20 mM 
HEPES (pH 7.0) for 1 h to allow interaction between EGF and the EGFR on the yeast 
cell surface. When using tyrphostin AG1478 (W-(3-chlorophenyl)-6,7-dimethoxy-4- 
quinazolinamine) (Cell Signaling Technology, Beverly, MA, USA), cells were treated 
with the reagent at 1 uM for 30 min before lysis. Yeast cells were suspended in a lysis 
buffer (50 mM Tris-HCl, pH 7.2, 150 mM NaCl, 1 mM EGTA, 1% (v/v) Triton X- 
100, 2 mM dithiothreitol and 1 tablet per 10 ml Complete 1M protease inhibitor 
cocktail (Roche, Basel, Switzerland), frozen in liquid nitrogen and then disrupted with 
vigorous shaking in an automatic homogenizer (Automill 1M ; Tokken, Chiba, Japan). 
After centrifugation, the supernatant was collected as the cytosolic fraction. The cell- 
wall fraction was extracted from cell debris with 0.5% (w/v) SDS and 1% (v/v) 2- 
mercaptomethanol 42 . Each fraction was subjected to SDS-PAGE followed by western 
blotting with a horseradish peroxidase (HRP)-conjugated anti-HA antibody (clone 
12CA5, Roche), an HRP-conjugated anti-V5 antibody (clone V5-10, Sigma- Aldrich, 
St Louis, MO, USA), anti-EGFR pY-1068 mouse monoclonal antibody (clone 15A2, 
nanoTools, Teningen, Germany), anti-EGFR pY-1148 rabbit polyclonal antibody 
(Novus Biologicals, Littleton, CO, USA), anti-EGFR pY-1 173 goat polyclonal 
antibody (Santa Cruz Biotechnology, Santa Cruz, CA, USA) and appropriate HRP- 
conjugated secondary antibodies (anti-mouse IgG (GE Healthcare, St. Giles), anti- 
rabbit IgG (GE Healthcare), anti-goat IgG (Santa Cruz)). Immunoreactive bands were 
visualized with an ECL Plus Western Blotting Detection System (GE Healthcare). 

Immunocytochemical analysis. Yeast cells were incubated with SD medium 
containing 20 mM HEPES (pH 7.0) for 1 h to allow interaction between EGF and the 
EGFR on the yeast cell surface. Cells were incubated in fixation buffer (40 mM 
potassium phosphate, pH 7.2, 4% (w/v) paraformaldehyde (PFA) at room 
temperature for 30 min, treated with 12 U/ml zymolyase 100 T (Seikagaku, Tokyo, 
Japan) in sorbitol buffer (40 mM potassium phosphate, pH 7.2, 1.2 M sorbitol and 
0.5 mM MgCl 2 ) to produce spheroplasts, incubated in blocking buffer (10 mM Tris- 
HCl, pH 7.2, 150 mM NaCl and 2% (w/v) bovine serum albumin (BSA) and then 
labeled with appropriate primary antibodies at room temperature for 1 h. To detect 
HA-EGF-FL042 and HA-IL5-FL042, intact cells were used instead of spheroplasts. 
Labeled cells were washed with TBS (10 mM Tris-HCl, pH 7.2 and 150 mM NaCl), 
incubated in blocking buffer containing Alexa Fluor 488-labeled anti-IgG (Invitrogen) 
at room temperature for 30 min, washed with TBS and then analyzed with a 
FLUOVIEW FV1000 confocal laser scanning microscope (Olympus, Tokyo, Japan) 
and FACSCanto IM II flow cytometer (BD Biosciences, San Jose, CA, USA). Antibodies 
used were as follows: anti-HA (clone 12CA5), anti-V5 (clone V5005, Nacalai, Kyoto, 
Japan), anti-EGFR phosphotyrosine-1068 (pY-1068) (clone 15A2, nanoTools, 
Teningen, Germany), anti-EGFR pY-1148 (Novus Biologicals, Littleton, CO, USA) 
and anti-EGFR pY-1173 (Santa Cruz Biotechnology, Santa Cruz, CA, USA). 
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Split-ub membrane two-hybrid assay. S. cerevisiae NMY51 harboring pPR3-SUC- 
EGFR (pray), pBT3-SUC-EGFR (bait) and either pYES2-MFalprepro-HA-EGF- 
FL042 (ligand) or pYES2-MFcdprepro-FL042 (control) were plated on a His- 
depleted SD plate or an X-Gal- containing SD plate and incubated at 30°C for 8 days. 

Complementation assay with the CDC25 ts strain. S. cerevisiae cdc25h harboring 
pGMH20-MFotlprepro-FLAG-EGFR, either pSos-Grb2 or pSos-Shcl and either 
pYES2-MFalprepro-HA-EGF-FL042 (ligand) or pYES2-MFotlprepro-FL042 
(control) were plated on an SD plate and incubated at 25 or 37°C for 7 days. 

De novo agonist screening with an automated single-cell analysis and isolation 
system. S. cerevisiae BJ5464 cells harboring pYES2-MFalprepro-HLH-FLAG- 
FL042 and pYES3-SUC-EGFR-V5 were incubated with SD medium containing 
20 mM HEPES (pH 7.0) for 1 h, fixed with PFA, treated with zymolyase, blocked with 
BSA and then stained with an anti-EGFR pYl 173 antibody. Approximately 2.5 X 10 5 
fixed spheroplasts in 1 ml PBS were individually placed into the microchambers 
(10 um diameter, 30 urn well-to-well pitch and 10 u.m well depth) of a cell array chip 
(256,000 wells; 20 X 20 wells/subarea; 20 X 32 subareas/chip on a 1.39 X 2.23 cm 2 
chip) by brief centrifugation (7 X g at room temperature for 1 min, three times) and 
then analyzed with the automated single-cell analysis and isolation system (As One 
corp., Osaka, Japan). Each cell with high fluorescence was collected by the automated 
single-cell analysis and isolation system in descending order of fluorescent intensity 
as described previously 19 . Briefly, microchambers containing no or more than 2 
fluorescent spheroplasts were excluded from further analyses. A histogram together 
with a list of correlations between the position and fluorescent intensity of each 
spheroplast was generated. Target spheroplasts could be virtually marked in 
descending order of fluorescent intensity. Spheroplasts of interest were automatically 
collected with a glass capillary attached to the micromanipulator of the automated 
single-cell analysis and isolation system, which were optimized not to pick up the 
adjacent non-target spheroplasts and confirmed by elimination of fluorescence in the 
target microchamber automatically. Upon failure, the system automatically repeated 
the collection process. Each spheroplast was transferred and released into the 
assigned well in 96-well PCR plates. The reciprocal movement of the glass capillary 
required 15 s for each spheroplast. After repeating the procedure 20 times, 83 
spheroplasts as positive clones isolated from about 3.2 X 10 6 independent clones were 
moved to a 96 well PCR plate and then subjected to single cell-based PCR. Primary 
PCR was carried out using 5'-GCC TTA TTT CTG GGG TAA TTA ATC AGC G-3', 
5'-ACC TAG ACT TCA GGT TGT CTA ACT CCT TCC-3' and KOD-Plus- 
(TOYOBO, Osaka, Japan) to amplify the DNA fragment encoding the GAL1 
promoter, MFodprepro-HLH-FLAG-FL042 and CYC1 terminator. The thermal 
cycling of the primary PCR was pre-heating at 94°C for 5 min; 12 cycles of denaturing 
at 94°C for 1 min; annealing from 73 C C to 61°C (-l°C/cycle) for 20 sec; extension at 
68°C for 90 sec; followed by 45 cycles of denaturing at 94°C for 1 min; annealing at 
64 C for 30 sec; extension at 68°C for 90 sec and final extension at 68 C C for 7 min. 
The secondary PCR was carried out using 5' -CAC CCA AGC TTG GGC GGA GCT 
CGC AGC TCT-3', 5'-GGC ACT GCC AGC ATA CGT TGA AAT-3' and KOD- 
Plus- to amplify the DNA fragment encoding HLH-FLAG and the N-terminal 20 
amino acids of FL042. The thermal cycling of the secondary PCR was pre-heating at 
94°C for 2 min; 12 cycles of denaturing at 94"C for 1 min; annealing from 77°C to 
63°C (-l°C/cycle) for 20 sec; extension at 68°C for 15 sec; followed by 45 cycles of 
denaturing at 94°C for 1 min; annealing at 66 J C for 30 sec; extension at 68 C for 
15 sec and final extension at 68 C for 7 min. The amplified DNA fragments were 
inserted under the control of the T7 promoter in the pET102/D-TOPO vector 
(Invitrogen) (pET102-HLH-FLAG). 

Bacterial expression of EGFR agonist candidates. E. coli BL21 (DE3) harboring 
pET 102 -HLH-FLAG was cultured in an Overnight Express Autoinduction System 
(Merck, Darmstadt, Germany). N-terminal thioredoxin (TRX)-fused and C-terminal 
His6-tagged forms of HLH-FLAG were purified by the BD TALON™ HT 96- Well 
Purification System (BD) according to the manufacture's protocol. Homogeneity of 
TRX- HLH-FLAG -His 6 protein was confirmed by SDS-PAGE followed by Coomassie 
brilliant blue R-250 staining. 

Autophosphorylation assay in A431 cells. Human squamous carcinoma A43 1 cells 
(approximately 5 X 10 4 cells) cultured on a 3.5 cm dish were incubated in serum-free 
medium for 16 h and then treated with 0.1, 1.0, or 2.0 uM TRX-HLH-FLAG-His6 
proteins for 5 and 20 min. As a positive control, 1.5 nM recombinant human EGF 
(PEPROTECH, Rocky Hill, NJ, USA) was used. When using tyrphostin AG1478, cells 
were treated with the reagent at 200 nM for 30 min before ligand {i.e., EGF, HLH) 
treatment. Cells were lysed with lysis buffer (50 mM Tris-HCl, pH 7.2, 150 mM 
NaCl, 1 mM EGTA, 1% Triton X-100, 1 mM Na 3 V0 4> 1% (v/v) phosphatase 
inhibitor cocktail I (SIGMA) and 1 tablet/50 ml Complete™ (Roche), mixed with 1 X 
Laemmli's sample buffer and then boiled for 15 min. Samples were analyzed by 
western blotting using anti-EGFR (Rockland, Gilbertsville, PA, USA) and anti-EGFR 
pY-1173 antibodies. 
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